Analysis of IS6110 insertion sites provide a glimpse into genome evolution of Mycobacterium tuberculosis
نویسندگان
چکیده
Insertion sequence (IS) 6110 is found at multiple sites in the Mycobacterium tuberculosis genome and displays a high degree of polymorphism with respect to copy number and insertion sites. Therefore, IS6110 is considered to be a useful molecular marker for diagnosis and strain typing of M. tuberculosis. Generally IS6110 elements are identified using experimental methods, useful for analysis of a limited number of isolates. Since short read genome sequences generated using next-generation sequencing (NGS) platforms are available for a large number of isolates, a computational pipeline for identification of IS6110 elements from these datasets was developed. This study shows results from analysis of NGS data of 1377 M. tuberculosis isolates. These isolates represent all seven major global lineages of M. tuberculosis. Lineage specific copy number patterns and preferential insertion regions were observed. Intra-lineage differences were further analyzed for identifying spoligotype specific variations. Copy number distribution and preferential locations of IS6110 in different lineages imply independent evolution of IS6110, governed mainly through ancestral insertion, fitness (gene truncation, promoter activity) and recombinational loss of some copies. A phylogenetic tree based on IS6110 insertion data of different isolates was constructed in order to understand genome level variations of different markers across different lineages.
منابع مشابه
Identification and evolution of an IS6110 low-copy-number Mycobacterium tuberculosis cluster.
A cohort of 56 patients infected with related strains of Mycobacterium tuberculosis, the S75 group, was identified in a New Jersey population-based study of all isolates with a low number of copies of the insertion element IS6110. Genotyping was combined with surveillance data to identify the S75 group and to elucidate its recent evolution. The S75 group had similar demographic and geographic c...
متن کاملDetermining the genomic locations of repetitive DNA sequences with a whole-genome microarray: IS6110 in Mycobacterium tuberculosis.
The mycobacterial insertion sequence IS6110 has been exploited extensively as a clonal marker in molecular epidemiologic studies of tuberculosis. In addition, it has been hypothesized that this element is an important driving force behind genotypic variability that may have phenotypic consequences. We present here a novel, DNA microarray-based methodology, designated SiteMapping, that simultane...
متن کاملAnalysis of sequence diversity among IS6110 sequence of Mycobacterium tuberculosis: possible implications for PCR based detection
The IS6110 belongs to the family of insertion sequences (IS) of the IS3 category. This insertion sequence was reported to be specific for Mycobacterium tuberculosis complex and hence is extensively exploited for laboratory detection of the agent of tuberculosis and for epidemiological investigations based on polymerase chain reaction. IS6110 is 1361-bp long and within this sequence different re...
متن کاملGlobal study of IS6110 in a successful Mycobacterium tuberculosis strain: clues for deciphering its behavior and for its rapid detection.
The Mycobacterium tuberculosis insertion sequence IS6110, besides being a very useful tool in molecular epidemiology, seems to have an impact on the biology of bacilli. In the present work, we mapped the 12 points of insertion of IS6110 in the genome of a successful strain named M. tuberculosis Zaragoza (which has been referred to as the MTZ strain). This strain, belonging to principal genetic ...
متن کاملThe control of copy number of IS6110 in Mycobacterium tuberculosis.
Insertion sequence (IS) elements are bacterial genes that are able to transpose to different locations in the genome. These elements are often used in molecular epidemiology as genetic markers that track the spread of pathogens. Transposable elements have frequently been described as "selfish DNA" because they facilitate their own transposition, causing damage when they insert into coding regio...
متن کامل